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1 Introduction 

Does learning of non-native speech features lead to 
a reorganization of language-specific areas in the 
human brain? It is well known that we perceive and 
produce speech through the filter of our native lan¬ 
guage. While infants are able to discriminate most 
phonetic categories in the first months of life, they 
already develop prototypical phoneme representa¬ 
tions of their native language in the first years of life 
[1], Once established, these phoneme categories 
form a stable base for the consecutive speech per¬ 
ception and speech production [4], For instance, 
Finnish and Estonian adults perceive their native 
phoneme prototypes more sensitively than non¬ 
native phonemes [7], Japanese listeners have diffi¬ 
culties in distinguishing IV and /r/ and perceive both 
phonetic categories as one [2], However, it has been 
shown that an intensive frequency discrimi-nation 
training can enhance the representational areas of 
the trained frequency [6], It is assumed as well, that 
discrimination of non-native speech contrasts can be 
learned through intensive training [3,5], Winkler et 
al. [8] demonstrated that Hungari-ans, who learned 
in adulthood to speak fluently Finnish, also 
developed cortical memory representa-tions for Fin¬ 
nish phoneme categories, which do not exist in their 
native language. 

The accurate perception of non-native phonetic 
categories can be measured by means of an auto¬ 
matic, preattentive change detection component of 
the Auditory Evoked Response (AER) called the 
Mismatch Negativity (MMN) and its magnetic 
counterpart, the Mismatch Field (MMF) [7,8], The 
phonemic memory traces revealed by the 
MMN/MMF seem to be language specific, as there 
is an enhancement of this component for native but 
not for non-native phoneme categories. The aim of 
the present study was to examine if an intensive dis¬ 
crimination training of very small differences of 
non-native speech contrasts affects the discrimi- 
nability performance of these features and also af¬ 
fects the brain’s automatic change detection re¬ 
sponse as reflected by the MMF. 


2 Methods 

In the German language the duration of a phoneme 
is not a crucial discriminative feature. In other lang¬ 
uages the relative duration of a vowel or consonant 
can be sometimes the decisive feature for differenti¬ 
ating between meaningful words. For instance, in 
English, the difference between “heed” and “hid”, 
“bid” and “bead” or “wooed” and “wood” is en¬ 
coded only by the length of the vowel. In languages 
like Finnish or Hungarian every short vowel has a 
corresponding long vowel and every long vowel a 
short counterpart. In a similar way the duration of 
some consonants is used in a short and long variant. 
The Japanese language has a restricted amount of 
possible syllables. This led to a high amount of ho¬ 
monymous syllables in Japanese so that meaningful 
words are often differentiated only by the length of a 
vowel or consonant when context is not available. 
For the present study meaningful Japanese words 
were generated of natural spoken words at the De¬ 
partment of Speech and Cognitive Sciences of To¬ 
kyo, differing in 8 degrees of duration of a vowel or 
a consonant (Fig. 1). For measurements, the stimuli 
were lowpass-filtered at 4.5 kHz and the on- and 
offset of the vowel segment was corrected with a 
cosine-like slope to eliminate clicks and steep 
flanks. All stimuli parameters were kept constant 
except the length of a vowel or a consonant. The 
first (long) and the last (short) duration always 
formed one separate perceptual category for Japane¬ 
se listeners, corresponding to a meaningful word, 
but not for German listeners, who did not associate 
any meaning, neither with the long, nor with the 
short variant. All 8 difference stages of the pairs of 
stimuli (anni-ani, itte-ite, kiyo-kyo and kiyou-kyou) 
were presented to 10 native German subjects for 
discrimination training. Each training session was 
accompanied by a discrimination test without feed¬ 
back. Before and after the training period the neu- 
romagnetic responses to 3 different stages of conso¬ 
nant and vowel duration were recorded in a mag¬ 
netically shielded room with a 37-channel biomag¬ 
netometer (Magnes, BTi). 




2.1 Discrimination training 

Subjects were trained on 10 consecutive days for 1.5 
hours according to the stair-case method with visual 
feedback. Similar to the oddball paradigm in MEG 
measurements, a frequent standard stimulus pair was 
interspersed with rare deviant stimulus pairs. The 
standard stimulus pair consisted always of two equal 
stimuli with longest duration. The deviant stimulus 
pair consisted of the stimulus with longest duration 
and one of 7 stimuli with a shorter duration de¬ 
pending on the discrimination performance reached 
in the training. The order of occurrence was ran¬ 
domized. The training started at the highest dura¬ 
tional difference and continued with progressively 
smaller durational differences until the smallest dif¬ 
ference to the standard stimulus was reached. Each 
training performance was cross-validated with the 
performance of a discrimination test without feed¬ 
back (Fig. 2). 



Figure 1: Left: The duration of the consonants in 
/anni/ and /itte/ is varied in 8 steps of 15 ms. Right: 
The duration of the vowel /// in /kiyo/ and /kiyou/ is 
varied in 8 steps of 15 ms. In MEG measurements 
the difference stages 8, 6, and 4 of /anni/ and /kiyo/ 
were tested. In the discrimination training all 8 dif¬ 
ferences were trained and tested. 


After each training session subjects were reinforced 
by informing them about their best performance in 
that training. A small financial reward was an¬ 
nounced for the best performance in all trainings. 

2.2 MEG measurements 

For MEG recordings of the MMF an oddball para¬ 
digm was adopted. Each recording consisted of 2 
sessions, one session with 3 different stages of 
vowel duration (kiyo 8-6-4) and another session 
with 3 different stages of consonant duration (anni 
8-6-4). For each condition 3 runs were grand-aver¬ 
aged. The deviant probability was set to 15% with a 
randomized ISI of 900 ±100 ms. The intensity of 
each stimulus was set to 70 dB SL above each sub¬ 
ject's measured detection threshold. Recordings 
were carried out from the left auditory cortex while 
stimulated contralaterally. 


3 Results 


3.1 Discrimination training 



Difference stages (1-8) 


Figure 2: Hitrates for each of 8 difference stages in 
every third training with feedback. The discrimina¬ 
tion performance increases exponentially in the 
small difference stages until the end of training. The 
high differences are differentiated easily at an early 
time point whereas the performance in small differ¬ 
ences increases until the end of training. 


Improvements in the training occurred mainly in the 
small difference stages 2-4 until the end of training. 
In large difference stages 5-8 the best performance 
was reached early and maintained until the end of 
training (Fig. 2). These findings suggest that there is 
a categorical boundary at difference stages 4 and 5 
between the long and the short version of the stim¬ 
uli. These results are corroborated by the perfor¬ 
mance obtained in tests without feedback before and 
after training sessions. 



























































































































































3.2 MEG measurements 



Figure 3: From left to right: Grand-averaged MMF 
waveforms from the largest to the smallest differ¬ 
ence for /anni/ (top) and /kiyo/ (down) demonstrated 
on a single subjects data. 

For evaluation of the MMF the grand-averaged re¬ 
sponses to the standards were subtracted from 
grand-averaged responses to deviants. A source 
analysis based on a single moving dipole model in a 
sphere was performed. The coordinates estimated 
here were used to calculate the source strength in a 
fixed dipole model. 

The MMF for /anni/ is recorded from 370 to 450 ms 
after stimulus onset and from 170 to 250 ms for 
/kiyo/ (in both cases 150-230 ms after the point of 
uniqueness of the stimuli). An increase of the am¬ 
plitudes of the MMF after the training is demon¬ 
strated for a single subject in Fig. 3. In grand-aver¬ 
ages of all subjects (Fig. 4) a significant increase of 
the amplitudes of the MMF was found after training 
for anni8 (p<0.05). For anni6 and anni4 the diff¬ 
erences were not significant, but for kiyo8 and kiyo6 
the differences were highly significant (p<0.01), 
whereas for kiyo4 the difference tended to signifi¬ 
cance. In the conditions with the large differences 
(anni8 and kiyo8) a slight decrement of the latency 
was also observed. The increment in the responses 
to duration differences in vowels showed a clearer 
and profounder MMF as to consonant differences 
(Fig. 4). 
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Figure 4: From left to right: Grand-average of the 
RMS values (mean, s.e.m.) of all subjects from the 
largest to the smallest difference (8-6-4) for the con¬ 
sonant condition in /anni/ (left) and the vowel con¬ 
dition in /kiyo/ (right). 


4 Discussion 

The obtained results indicate that phonemes differ¬ 
ing only in small durational differences are learned 
to be discriminated after relatively short training 
time and a performance improvement up to 120% is 
reached in the smallest difference after the last 
training compared to the performance before the 
first training. A similar effect occurs on the neuro¬ 
physiologic level. The preattentively processed pho¬ 
neme contrasts, as reflected by the MMF, appear to 
require more of the neuronal resources after the 
training. Even small differences are perceived easier 
and result in a higher activation of the underlying 
neuronal networks. Speech-related plasticity seems 
to occur as a consequence of discrimination learn¬ 
ing. This implies that the cortical representations of 
language features can be remodeled dynamically by 
intensive behavioral training even in adulthood. The 
effects of amplitude increment and latency 
decrement is similar to frequency discrimination 
learning as reported earlier [6], Intensive training 
improved not only the discrimination performance 
of the subjects but changed also the neural 
representation of speech sounds as reflected by the 
automatic change detection response, the MMF. 
Assumed that the different length of phonemes is 
learned to be perceived as different phonemic cate¬ 
gories by the German listeners, the shift of the cate¬ 
gory boundary to smaller differences achieved in the 
training can be hypothesized to be due to phonemic 
processing. Further studies have to investigate if this 
processing differs substantially from the processing 
of durational differences in native language or non¬ 
speech material with the same parametric character¬ 
istics. 
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